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Abstract. We consider evaluating improper priors in a formal Bayes setting accord- 
ing to the consequences of their use. Let <P be a class of functions on the parameter 
space and consider estimating elements of <P under quadratic loss. If the formal Bayes 
estimator of every function in <P is admissible, then the prior is strongly admissible 
with respect to <P. Eaton's method for establishing strong admissibility is based on 
studying the stability properties of a particular Markov chain associated with the 
inferential setting. In previous work, this was handled differently depending upon 
whether ip G <Z> was bounded or unbounded. We introduce and study a new Markov 
chain which allows us to unify and generalize existing approaches while simultane- 
ously broadening the scope of their potential applicability. To illustrate the method, 
we establish strong admissibility conditions when the model is a p-dimensional multi- 
variate normal distribution with unknown mean vector 9 and the prior is of the form 

v(\\e\\ 2 )de. 



1. Introduction 

Suppose we are in a parametric setting, and we are considering use of an improper 
prior measure that yields a proper posterior distribution. Such priors arise in the 
absence of honest prior belief about parameter values and are typi cally derived from 



structural arguments based on the likelihood or the parameter space 13]. Thus, an im- 
proper prior, rather than being a statement of beliefs specific to a situation, is a default. 
Such priors, proposed from likelihood or invariance arguments, require evaluation, just 
as estimators proposed from likelihood or invariance arguments require evaluation, and 
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an attractive avenue is to evaluate the prior according to the consequences of its use. 
That is, we can evaluate the prior by examining properties of the resulting posterior 
inferences. The criterion we use to judge posterior inferences is known as strong admis- 
sibility. This concept was introduced by Eaton p and has given rise to a substantial 

theory 0,0,3 30,0,13, HQ H] . Our goal in the rest of this section is to convey 



the basic idea behind strong admissibility and the way it is studied here. We also sum- 
marize our main results without delving too far into the details, which are dealt with 
carefully later. 

Suppose the sample space X is a Polish space with Borel a-algebra B and the pa- 
rameter space i? is a Polish space with Borel cr-algebra C. Let {P(-\9), 9 G t?} be 
a family of sampling distributions where we assume that for each B G B, P(B\-) is 
C-measurable and for each 9 G t?, P{-\9) is a probability measure on (X, B). Let v be a 
cr-finite measure on the parameter space $ with z/(i?) = oo. Throughout the marginal 
on X 



(1) M(dx) := / P(dx\9) v(d9) . 

is assumed to be cr-finite. In this case, the disintegration 

(2) Q(d9\x) M(dx) = P(dx\9) p{d9) 

generalizes Bayes theorem and implicitly defines formal posterior distributions Q(-\x) 
on the parameter space. Note that for each x G X, Q(-\x) is a probability measure on 
C) and for each C G C, Q(C\-) is B- measurable. Taraldsen and Lindqvist [23( provide 
a recent, accessible introduction to the existence of formal posterior distributions while 
consult Eaton g0 and Johnson for more details and references. 



one can 



Suppose ip : •& — > MP for p > 1, and consider estimating ip(9). The formal Bayes 
estimator of ip under squared error loss is the posterior mean 

(3) <p(x) := [ p{9)Q{d9\x). 

Let || • || denote the usual Euclidean norm. If 5 (x) is any estimator of <p(9), the risk 
function of S is 

(4) R(5;9):= [ \\p(9) - 5(x)\\ 2 P(dx\9) . 

The estimator 5 is almost-v admissible if for any other estimator Si such that R(9; 5i) < 
R(6>; 5) for all 9 G tf, then the set {9 : R(0; 5{) < R(0; 5)} has //-measure 0. 
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Since we will use admissibility to judge the prior, our interests are more ambitious 
than establishing admissibility of a single estimator. Let <P be a class of functions 
defined on the parameter space. If the formal Bayes estimator of every tp G <P is almost- 
v admissible, then we say the prior (equivalently the posterior) is strongly admissible 
with respect to <P. A prior is strongly admissible if it is robust against risk dominance 
within the class <£>. Since a default prior will undergo repeated use, it is important 
for the range of appropriate uses to be clearly defined and desirable that the range 
be as large as possible. We can then endorse the improper prior insofar as it avoids 
unreasonable actions in a variety of such problems. 

Previous work on strong admissibility focused on the case where $ consisted of a single 

unbounded function 0, y, 0] , or all bounded functions 0, 0, 0, [13, Lul, H3, H, Lisl . in 



either case, strong admissibility was established either by verifying sufficient conditions 
for the admissibility of an estimator established by Brown |4J or via Markov chain 
arguments using an approach developed by Eaton p, |8j. We study the latter method. 

Eaton's method for establishing almost-z/ admissibility of formal Bayes estimators is 
based on the recurrence properties of a Markov chain associated with the inferential 
setting; the relevant notion of recurrence is defined in the next section. However, 
different Markov chains were required depending upon whether Lp was bounded jsj or 
unbounded |8]. We introduce and study a new Markov chain which allows us to unify 
and generalize these existing approaches while simultaneously broadening the scope of 
their potential applicability. The expected posterior 

(5) R{dd\r])= I Q(d6\x)P(dx\n) . 

Jx 

is a Markov kernel on We study transformations of R, which are now described. Let 
/ : -d x ■& -> R+ satisfy f(0, rj) = f(r), 9) and set 



T(v)= I f(e, v )R(de\ v ) 

If < T(rj) < oo for all rj G then 



T(d6\ V ) = t^R(M\n) 

is a Markov kernel on Recurrence of the Markov chain associated with T implies 
the almost-!/ admissibility of formal Bayes estimators with respect to a large class of 
functions. Define 

(6) <P f := : \\<p{6) - <p{rj)f < M v f{6,rj) some < M v < oo} . 
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We prove that if the Markov chain defined by T is recurrent, then the formal Bayes 
estimator of every function in $f is almost-!/ admissible, and we say the prior v is 
strongly admissible with respect to <Pf. The following example illustrates this technique. 

Example 1. Let X be a p-dimensional normal random variable with identity covariance 
I p and unknown location 9. Let p-dimensional Lebesgue measure u(d9) = d9 be our 
improper prior. The proper posterior for 9 is a normal with mean x — the observed 
value of the random variable X — and covariance I p . The kernel 

R(d9\r]) = j Q(dO\x)P(dx\rj) = (4vr)- p / 2 exp (~\\0 - vfj d9 

describes a N(i], 2I P ) random variable. Let d be an arbitrary positive constant and note 
that f(9, rj) = \\9 — r]\\ 2 + d is symmetric and uniformly bounded away from 0. Further 

T(v) = j f(e,rj)R{dB\rj) = J (||0 - r,\\ 2 + d) R(d9\rj) = 2p + d. 



Thus, 



T{d9\rf) = — 2~jj~^~ - R{d9\rf) 



and the chain with kernel T is a random walk on MP. Note that 

\9 -r}\\ p T{d9\rf) < oo 



since a normal distribution has moments of all orders and hence for p = 1 or p = 2, 
the chain is recurrent [l, EoJ- We conclude that for p = 1 or p = 2 Lebesgue measure is 
strongly admissible. That is, the formal Bayes estimators of all functions ip satisfying 

\\tp{9) - (p(rj)\\ < M v (\\9 - 77H 2 + d) for some < M v < 00 

are almost-^ admissible. Since d is arbitrary this includes all bounded functions as well 
as many unbounded functions. 

Lebesgue measure is not strongly admissible with respect to <Pf when p > 3 since the 
James-Stein estimator dominates the formal Bayes estimator of <f{9) = 9. In Section H] 
we consider the normal means problem with an alternative prior when p > 3. We will 
return to this example below. 

Recurrence of the Markov chain described by T implies more than we have so far 
claimed. Suppose the function u: 1? — > M. + is bounded away from zero and infinity so 
that 1/c < u(9) < c for some constant c > 1 and every 9 G t?. The measure defined by 

(7) v u {d9) := u{9) v{d9) 
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is a bounded perturbation of v. Let J- v be the family of all bounded perturbations of v. 
Observe that v £ J- v and that the other elements of T v are measures with tail behavior 
similar to v. Eaton [6] showed that recurrence of the chain with kernel R implied the 
formal Bayes estimators of any bounded function is almost-^ admissible for every prior 
in J- v . We extend this result and show that recurrence under T is sufficient for the 
strong admissibility with respect to <Pf of every element of T v . In this case, we say the 
family T v is strongly admissible with respect to <Pf. 

Example 2. Recall the setting of Example HJ Let JF V be the bounded perturbations of 
p-dimensional Lebesgue measure v[dff) = d6. That is, elements of T v are measures of 
the form v u [d&) = u(6)d$ where 1/c < u{9) < c for some constant c > 1. Recall that 
<E>f is the class of all bounded functions and all functions satisfying 

\\<p{6) - (p{ri)\\ < M v (||0 - 7]\\ 2 + d) for some < M v < oo . 

The recurrence of the Markov chain governed by T when p = 1 or p = 2 implies 
the formal Bayes estimators of every (p £ $f are almost-^ admissible for every prior 
v u £ T v . That is, T v is strongly admissible with respect to <Pf 

Our main results generalize existing work Jfi[ [s| and in fact unify the analysis for 
bounded and unbounded functions. We show that the Markov kernel R can be trans- 
formed to define many Markov chains, any one of which might be used to demonstrate 
strong admissibility, thus greatly broadening the scope of potential applications. We are 
never concerned narrowly with a single admissibility problem but broadly with species 
of problems. Moreover, solving a single representative problem, which representative 
we are free to elect, solves any problem within a bounded rate of change — whether of 
the function to be estimated or the prior used to estimate it. 

The remainder is organized as follows. Section |2]gives some background on recurrence 
for general state space Markov chains. Section [3] presents the main results and Section @] 
illustrates the main results by considering the multivariate normal means problem. 
Finally, many technical details are deferred to the appendices. 

2. Recurrence of Markov chains 

The goal of this section is to introduce a general notion of recurrence for Markov 
chains. Let W be a Polish space and denote the Borel a-algebra by T>. Let K : 
T> x W — > [0,1]. Then K is a Markov transition kernel on the measurable space 
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(W, T>) if K(D\-) is a nonnegative measurable function for every D £ T> and K(-\w) is 
a probability measure for every w G W. 

The kernel determines a time-homogeneous Markov chain W = {Wo, W%, W2, ■ ■ ■} 
on the product space W°° which is equipped with the product cx-algebra V°° . Note 
that conditional on W n = w, the law of W n +i is K(-\w). Given Wq = w, let Pr(-\w) be 
the law of W on W°°. 

Suppose D <EV. The random variable 

00 if W n ^ D for all n > 1, or 
the smallest n > 1 such that W„ € -D otherwise 



td 



is a stopping time for D, and -Ed = {td < 00} is the set of paths that encounter D 
after initialization. Let £ be a non-trivial, cr-finite measure on (W, P) and recall that a 
set is £ -proper if its measure under £ is positive and finite. 

Definition 2.1. A ^-proper set D is locally ^-recurrent if Pr(£'x)|w) = 1 for all but a 
£-null set of initial values in D. Call the Markov chain W locally ^-recurrent if every 
^-proper set is locally ^-recurrent. 

This notion of recurrence is mo re g eneral than that typically encountered in general 



state space Markov chain theory 19], but is appropriate since the chains we will con- 



sider in the next section may not be irreducible [10|, llJ]. We consider a method for 
establishing local recurrence in Section 13.21 

The connection between the above general Markov chain theory and the notion of 
strong admissibility relies heavily on the special structure of symmetric Markov chains. 
Let £ be a non-trivial, cr-finite measure. Then the kernel K is ^-symmetric if the 
measure 

cj(A, B) := [ K{B\w)i(dw) 

J A 

satisfies u(A,B) = u)(B,A) for all A,B 6 D. Throughout the remainder we restrict 
attention to symmetric Markov kernels. Eaton 0, 0] provides some background on the 
theory of symmetric Markov chains underlying strong admissibility. 

3. Strong Admissibility via Markov chains 



In a ground-breaking paper Eaton |6J connected the local recurrence of a Markov 
chain and the almost-z/ admissibility of the formal Bayes estimator for bounded (p. 
Later, Eaton [8] showed that the local recurrence of a different Markov chain was 
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required to establish almost- 1/ admissibility when ip is unbounded. Since our work 
builds on them, these foundational definitions and results are stated carefully here. 
The expected posterior distribution at the parameter value rj, R(d6\rj) defined in (j3J), 
is a ^-symmetric Markov kernel on -d since it satisfies the detailed balance condition 

(8) R{d9\rj) v{dq) = R(dr,\6) v(dd) . 



Call R an Eaton kernel-analogous kernels exist for the sample space 12J and the prod- 
uct of the sample and parameter spaces 10]. Eaton [6] established the following basic 
result. 

Theorem 3.1. If the Markov chain with kernel R is locally-v recurrent, then the formal 
Bayes estimator of every bounded function is almost-v admissible. 

Eaton j^l also showed that the result holds for bounded perturbations of v. Theo- 



rem 



13.11 has found substantial application 0,0,0 



11 



12 



13 



18| 



Now suppose we want to use Markov chains to study formal Bayes estimators of 
unbounded functions on ■&. We need a basic assumption on the risk function R, defined 
at (j4]), to ensure existence of the integrated risk difference which is studied in the 
appendices. 

Assumption 3.1. Suppose there exist sets Ki C K 2 C • • • with UK~i = Also assume 
that for all i, < v(Ki) < oo and 

[ R((p; d)v{dd) < oo . 

JKi 

Let ip : t? — > W be measurable and unbounded and set 

s(v)= [ h(e) - <p{v)\\ 2 

so that if < S{rj) < oo for all rj, then 

S{rj) 

is a Markov kernel defining a Markov chain on Moreover S satisfies detailed balance 
with respect to S(r})u(drj): 

S{dB\ri)S{rj)v{dri) = S(dr,\0)S(6)v(d6) . 

Eaton [8] connected the local recurrence of S with the almost-!/ admissibility of the 
formal Bayes estimator of tp{6). 
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Theorem 3.2. Suppose Assumption \3.1\ holds and assume < S(rj) < oo for each 
rj G i? and there exist v-proper sets K\ C K 2 C • • • suc/i i/iai UKi = and for each i 

S{rj)v(drj) < oo . 



Ki 

If the Markov chain with kernel S is locally-u recurrent, then the formal Bayes estimator 
of ip is almost-v admissible. 

In the next section, we unify and generalize these results. We show that by analyzing 
an appropriate Markov chain we can recover the conclusions of both theorems and, in 
fact, achieve something stronger. Moreover, we broaden the class of Markov chains that 
can be studied to obtain strong admissibility results. 

3.1. A new Markov chain connection. For any rj G let be a nontrivial 

a-fmite measure on ("&,C) such that ip{-\rj) is absolutely continuous with respect to 
R(-\rj). For any element C of the Borel sets C, let ip(C\-) be a nonnegative measurable 
function. Let f(-,rj) be a Radon-Nikodym derivative of ip(-\rj) with respect to R(-\rj)— 



(10) ${C\n) = / f(6, rj) R(d6\ V ) for all (rj, C) G (<?, C). 

Jc 

Define 

(11) T{Clv) = j!M fOTall (^C)e(^C). 

Let T(r)) = ip( , d\ri) for all rj G i?. We will make the following basic assumptions on T. 

Assumption 3.2. For all rj G # we have < T(lj) < oo and there exist v-proper sets 
K\ C K 2 C • • • suc/i i/iat U-fQ = i?, and /or eac/i z 



/ T{rj)v{drj) < oo . 



The first part of the assumption ensures that the kernel T is well-defined. If [i(dr]) = 
T(rj)u(dri), then the second part of the assumption implies that \i is cr-finite. In Propo- 
sition [BJ] we establish that 

T(d9\ri)ti(dri) = f(6,T])R(dO\r})v{dq) . 

If f(6,rj) = f(rj,6) for all rj and 9 in i9, then by using (jSJ) it is easy to see that T is 
symmetric with respect to \i. 

Note that T[j]) is almost- 1/ uniformly bounded away from if there exists e > such 
that T{r\) > e except possibly on a set of //-measure 0. We are now in a position to 
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state the main result. The proof is given in Appendix El Recall the definition of $/ 
from (jSJ). 

Theorem 3.3. Suppose Assumptions \3 . 1\ and \3.2 hold. LetT(i]) be almost-v uniformly 
bounded away from zero and suppose that for all rj and 9 in "d 



If the Markov chain having kernel T(d9\rj) is locally v-recurrent, then v is strongly 
admissible with respect to (Pf. 

The following extension of Theorem 13.31 is also proved in Appendix |B] 

Theorem 3.4. Assume the conditions of Theorem \3.3[ Then every bounded perturba- 
tion v u is strongly admissible, that is, the family T v is strongly admissible with respect 

to <Pf. 

If we take f(9, rj) = d for some d > 0, then we completely recover the results of Theo- 
rem 13. II while if f(9, rj) = \\<f(9) — ip(r])\\ 2 , then we extend the results of Theorem 13.21 If, 
as in Examples [Q and [2J we set f{9,rj) = \\<p(6) — ip(r])\\ 2 + d with d > 0, then we obtain 
results stronger than if we had established local recurrence of the chains associated with 
the kernels R and S and relied on Theorems 13.11 and 13.21 Moreover, since the analyst 
has the freedom to choose an appropriate /, this result extends the range of potential 
applicability of Eaton's method. 

3.2. Reducing dimension. The Markov kernel T naturally takes the same dimension 
as the parameter space. This dimension may be quite large, making the required 
analysis difficult. In this section we prove that the conclusions of Theorem 13.41 (hence 
Theorem 13. 31) hold if we can establish the local recurrence of a particular Markov chain 
which lives on [0, oo). 

Denote the Borel subsets of [0, oo) by A. A measurable mapping t from C) to 
([0, oo),^4) induces a measure v given by 



(12) 



f(9, V ) = f( V ,9). 



u{r\A)) 



Ae A . 



We will need the next assumption throughout the remainder of this section. 



Assumption 3.3. There exists a partition {Ai} of [0, oo) such that each A{ is measur- 
able and each Cj = t~ 1 {A i ) is v-proper. 
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Eaton et alia H[ showed that under Assumption 13 . 3 1 there exists a Markov transition 
function n(d9\(3) on C x [0, oo) such that 

(13) v(dff) = 7T{d6\(3)9{dp) 

which means for all measurable nonnegative functions f\ on $ and f'2 on [0, 00) 
J f2(t{6))h(fi)u{d0) = J o °°fM (J h{OMd0\0) \ u(dfi) . 

Define 

(14) P(dx\p) = [ P(dx\9)n(d9\(3) (3 e [0, 00) . 

The conditional probabilities P(-\0) form a parametric family indexed by f3, and u{d0) 
is a a-finite prior. Eaton et alia [ll| also showed that the marginal measure on X is 
the same as at ([T]). That is, 

poo 

/ P(dx\p)u(dp) = M(dx), 
Jo 

which is assumed er-finite. Thus, there is a Markov kernel Q(d(3\x) satisfying 

(15) Q(df3\x)M(dx) = P{dx\P)v{dp) . 



In fact, a version of the posterior 11] is 

(16) Q{A\x) := Q(t-\A)\x) for all AeA 
The expected posterior 

(17) R(dp\a) = [ Q{d(3\x)P{dx\a) 



is a zz-symmetric Eaton kernel. 

If 1] and 9 are elements of let a = t(j]) and ft = t{9). For any a G [0, 00), let 
-0(-|tt) be a nontrivial a-finite measure on ([0,oo),^4) such that i>{-\ot) is absolutely 
continuous with respect to R(-\a). Let /(•, a) be a Radon-Nikodym derivative of i^(-\cx) 
with respect to R(-\a) — ie, for all (a, A) e ([0,oo),^4) 



$(A\a)= f f(f3,a)R(d(3\a) . 

J A 



Define T(a) = ip([0, 00) |a) and set 



f(A\a) = ^^- (a,A)e([0 lO a),A) 
T(a) 
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and let £i(da) = T{oi) v{da). The following assumption ensures that T is a well-defined 
kernel and that jl is a-fmite. 

Assumption 3.4. For all a we have < T{a) < oo and there exist i)-proper sets 
K\ C K-x C • • • such that UKi = [0, oo), and for each i 



T(a)u(da) < oo . 



K, 



In Appendix [C] we show that T is /t-symmetric. The following theorem shows that 
we can analyze the recurrence properties of the chain defined by T to achieve the 
conclusions of Theorem 13.41 (hence Theorem 13.31) . The proof is given in Appendix O 
while use of the result is illustrated in Section HJ Recall that T v is the family of bounded 
perturbations of v. 



Theorem 3.5. Suppose Assumption \3.4\ holds. LetT(-) be almost-v uniformly bounded 
away from and suppose that for all a and (3 in [0, oo) 

f((3,a) = f(a,(3) . 

If the Markov chain with kernel T is locally v-recurrent, then T v is strongly admissible 
with respect to 

Since T lives on [0, oo) it would be convenient to have conditions which guarantee 
the local recurrence of a Markov chain on [0, oo). This is discussed in the following 
section. 

3.2.1. Recurrence of Markov chains on [0, oo). To this point we have said little about 
establishing local recurrence. The following theorem presents one method for doing so 
and is a distillation of several existing results It applies generally to Markov 

chains on [0, oo) and hence the notation in this section is consistent with that of Sec- 
tion 

Let W = [0, oo) and V be the Borel cx-algebra. Let K : V x W -> [0, 1] be a Markov 
kernel which defines a time-homogeneous Markov chain W = {Wo, W%, W2, . . .} on W°°. 
Define the kth moment of K about its current state as 



(18) m k {v)= (x-v) k K(dx\v) . 

Jo 

Theorem 3.6. Assume for each positive integer n there exists S(n) < 1 such that 

(19) sup K([0,n]\v) < S(n) . 

ue[Q,n] 
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Suppose 

, , , log u 777,3 fu) 
20 lim — ^4 = 

y^oo V 777,2 (l?) 

and there exists a function <fi and an integer no such that for v £ [no, 00) 
(21) mi(v) < m2 ^ V ' [1 + 0(^)1 and lim <j>(v) \ogv = . 

2v v— >oo 

If £ is a non-trivial, a-finite measure, < £([0, n )) < 00 and iY is symmetric, then 
the Markov chain W is locally ^-recurrent. 

We will use Theorem 13.61 in conjunction with Theorem 13.51 in our main application 
in Section HJ 



4. Admissible Priors for the Multivariate Normal Mean 

Let X ~ N p (0, I p ) with p > 1. Consider the family of cr-finite measures on fl = M p 
described by 

(22) (^W)' d " a£0 ' 6£0 ' 

For a = 0, the prior is improper for all b > 0, but the induced marginal distributions on 
X are only a-finite for b < p/2. For a > 0, the family yields improper prior distributions 
when b < p/2 and proper prior distributions when b > p/2. In fact, if b = (a + p)/2, 
the prior is the kernel of a multivariate t distribution with a degrees of freedom. When 
a > and 6 = 0, this is p-dimensional Lebesgue measure which was considered in 
Example [TJ 

Now suppose p > 3. Berger, Strawderman and Tang [3] established that the formal 
.3 ayes estimator of 9 is admissible when a = 1 and b = (p — l)/2 while Eaton et alia 



llj use Theorem 13.11 to prove that if a > and &£ [p/2 — 1, p/2), then the formal 
Bayes estimator of every bounded function is almost admissible. We use Theorems 13.51 
and 13.61 to add to these results. Let d > be arbitrary and define 

f(e,r)) = \\9-r)\\ 2 + d 6, neW 

so that 

#/ = {<p : \\(p(9) -<p(v)\\ < M v f{e,r}) some < M v < 00} . 
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Theorem 4.1. For p > 3 let X ~ N p (9, I p ) and with a > set 



Then JF V , the family of bounded perturbations of v, is strongly admissible with respect 
to <P f . 

Proof of Theorem J^.t. Let 7 a (z) = (a + z)~ p ^ 2 , so that our family of priors (122]) can 
be expressed as 7 a (||#|| 2 ) d9. The function t{6) = \\9\\ 2 fulfills the requirements of 
Assumption 13 . 31 as can be seen by letting the sets Ai = [i — l,i) partition the nonnegative 
real numbers. Letting ir(d8\(3) denote the uniform distribution on the hypersphere of 
radius y/j3, it can be shown that u(d6) = 7r(<i#|/3)z/(<i/3) with 

W) = 1 ^| ! 7 a (/3)^- 1 ^ 

on [0, oo). Let tti be the uniform distribution on the unit hypersphere S. Then the 
reduced sampling distribution has density 

p(x\P) = j^2n)-^exp f-l\\x - Cv^ll) 
with respect to Lebesgue measure on X . If 

/■oo 

mix) = / p(x\/3)u(dp), 



then the formal posterior has density 

[r(i/2)f pWh.Off)/^- 1 



qw\x) 



r(p/2) m(x) 
with respect to Lebesgue measure on [0, oo). The expected posterior 

(23) R(d/3\a)= / q(/3\x) p(x\a) dx d/3 



A' 



is a //-symmetric Markov transition kernel on [0, oo). 

Let f(/3, a) = 2(/3 + a + c) with c a positive constant. It is clear that / is symmetric 
in a and (3 and bounded away from zero. For any nonnegative real number a, 



T(a) = J f(P, a) R(d/3\a) = 2 / (3 R(d/3\a) + 2a + 2c 

is greater than or equal to c. Thus the kernel 

(P + a + c)R(d/3\a) 



(24) T(d0\a) 



f(v + a + c) R(d 



via) 
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defines a /i-symmetric Markov chain on [0, oo) where 

fi(A) = / f(a) v(da) AeA. 



J A 

Clearly, for any integer n, p,([0,n)) < oo. 

The next step is to verify the conditions of Theorem 13.61 which will imply the Markov 
chain associated with T is locally pr recurrent. Since fx and v are equivalent measures 
we will also conclude that the chain is locally //-recurrent. In Appendix [D] it is shown 
that T([0, m]\a) is continuous as a function of a implying condition (|T9|) of Theorem 
13.61 For k a nonnegative integer set 

m k [ a ) = J (f3 -a) k f(d(3\a) . 

Additional calculations given in Appendix [D] show that 

(25) ^ = 0(1) asQ^oo 

m 2 (a) 

and 

8a + ip\(a) 



(26) mi(a) 



J((3 + a + c) R(d(3\a) 



where ipi(a) = 0(1) as a — > oo. Letting 0(a) = 1/a/o", note that lim^oo 4>{a) log(a) 
and, by calculations in Appendix [Dj 

m 2 (a) 8a + 8^ + ip 2 {a) 

(27) — — [l + (j){a)} 



2a " v f(P + a + c)R(dp\a) 

where ilh(oi) = 0(1) as a — > oo. It is clear by inspection of (I2"5"j) . f|2^|) . and ([27]) that 
the conditions (|20|) and (pT!) of Theorem 13.61 are satisfied for a large enough. Hence the 
chain is locally ^-recurrent. 

By Theorem 13.51 the family of priors T v is strongly admissible with respect to 

□ 



Appendix A. Preliminaries 

We begin by stating some existing results concerning local recurrence and intro- 
duce Blyth's method. This material plays a fundamental role in our proofs of Theo- 
rems I3.3[ 13.41 and 13.51 
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A.l. Local Recurrence. The purpose here is to give two characterizations of local 
recurrence for general symmetric Markov chains, hence the notation is consistent with 
that of Sections [2] and 13.2.11 

Let W be a Polish space and denote the Borel a-algebra by V. Let K : V x W — > [0, 1] 
be a Markov kernel on (W, T>). Let £ be a non-trivial cr-finite measure and recall that 
a set is ^-proper if its measure under £ is positive and finite. Throughout this section 
K is assumed to be ^-symmetric. 



Theorem A.l (lEaton [9]). The Markov chain W is locally ^-recurrent if and only if 
there exists a sequence of £ -proper sets increasing to the state space such that each is 
locally £ -recurrent. 

Let £ 2 (£) be the space of ^-square integrable functions. Then the quantity 
(28) A(h; K,Z) = \jf W) ~ h( V )) 2 K(d9\ V )£(d V ) h G L 2 (£) 

is called a Dirichlet form. Also, if D is a ^-proper set, define 

Ht(D) := {h E L 2 (£) : h > I D } 

where Id is the indicator function of the set D. A characterization of local ^-recurrence 
in terms of A is given by the following result. 



Theorem A. 2 (Eaton 8(). A set D £ V is locally ^-recurrent if and only if 

inf A(h;K,n = . 

A. 2. Blyth's Method. Consider the posterior distributions obtained from perturba- 
tions of the prior measure v. Let g : •& — > M + be such that the perturbation 

(29) v g (dO) = g{0)v(de) 

is cr-finite. Assume 



(30) g(x) := J g(6)Q(d9\x) < oo . 
Letting 

M g (dx) := J P(dx\9)g(9)u(d9) 

it is easy to see that M g (dx) = g(x)M(dx) is cr-finite and that the posterior obtained 
from the perturbed prior is 

(31) Q g (d9\x) = ^-Q(d9\x) . 

9\?) 
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Let L 1 



{y) be the set of all z/-integrable functions and define 



(32) 



Q v = {g G L (y) : g > 0, g bounded and u g (i9) > 0} . 



Let Ic be the indicator function of the set C. Of particular interest are the subfamilies 



Let if be the Bayes estimator of ip under the prior v, and let if g be the Bayes estimator 
of ip under the prior v g . Also recall the definition of the risk function R at (jl]). A key 
quantity in connecting Markov chains to admissibility is the integrated risk difference 
at g 6 Gv with respect to v against 0: 



Notice that the integrability assumptions on R in Assumption I3.ll ensure that IRDj, is 
well-defined. The connection of IRD„ with almost-z/ admissibility is given by Blyth's 
method. 

Theorem A. 3 (Blyth's Method). Let ip be an estimator and v a a -finite measure on the 
parameter space. If m.ig v ^ C ) IRD„(<7; (p) = for every C C such that < ^(C) < oo, 
then f is almost-v admissible. 



We will develop a connection between the kernel T defined in fllip and the IRD^ 
which will be key to proving Theorem 13.31 via Theorems IA.2I and IA.31 We begin with 
some preliminary results before we prove Theorem 13.31 

B.l. Preliminary Results. Recall T(rj) = ip('d\ri) where ip is defined at ( flOl) . Define 



G U (C) = {geg v :g>I c } • 



(33) 




Appendix B. Proof of Theorem 13.31 



(34) 




for all C e C . 



Proposition B.l. Suppose Assumption s.^ and condition (fl2|) hold. Then 
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Proof. That /i is er-finite follows easily from Assumption 13.21 Since T is a nonnegative 
measurable function, for any z/-null set C, 



Furthermore, since T(rj) > for all r\ in the parameter space, every iz-positive set is a 
//-positive set. Thus, if C is /i-null, it is also zy-null. Therefore, \i and v are equivalent 
measures. Furthermore, T is a Radon-Nikodym derivative of \i with respect to v since 
\x is absolutely continuous with respect to u, and by assumption T is a nonnegative 
measurable function such that for any measurable set C, fi(C) is given by equation 



At any point 77 in the parameter space, since is a nontrivial finite measure, 

normalizing by T(t]) = ip("&\T]) produces a probability measure. Recall that ip(C\-) is 
a nonnegative measurable function where C is an element of the Borel sets C. Since 
i? is a Borel measurable set, T is a Borel measurable function. Also by hypothesis, 
7~(t)) is positive and finite for all 77. The reciprocal function is continuous, hence Borel 
measurable, on the positive real numbers, and the composition 1/7" of Borel measur- 
able functions is a Borel measurable function. Thus, the product of Borel measurable 
functions 1/7" and ip(C\-) is a Borel measurable function for any C G C. 

Let A and B be measurable sets. Since 




dMD. 




I A (v)lB(0)f(0,v)R(d6\v)v(dv) by (HDD, (ffU), and flM) 




by P and ([12]) 




5(5, A) 



the kernel T is //-symmetric. Note that the substitution 



(35) 



T(d9\r]) n(dr)) = f(6, rj) R(d8\rj) v(drj) 



holds as a consequence. 
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Let p! = c/i for some c > 0. Then by (IE]) and ( 1351) 

T(d9\r])fi'(dr]) = cT(d9\r])p(dr]) 

= cf(e,r))R(d9\r))v{dr)) 
= cf{r),6)R(dr]\e)v{d6) 
= cT(dri\8)fi(d6) 
= T{dri\6)n'{d6) 

which implies T is //-symmetric. □ 

We can now develop a connection between the Markov kernel T and the integrated 
risk difference IRD^. Our argument will require the following known result; recall the 
definition of Q v from fl32l) . 



Proposition B.2 (jEatonl . l200ll ). // g G Q v and R is Eaton's kernel, then 

IKD v (g; $)< [[ \\<p(0) - V (v)\\ 2 (V9(Q) - v*)) 2 R(de\rj) u(dr,) . 



By Proposition IB.ll we see that T is /x-symmetric so that the relevant Dirichlet form, 
recall ([28]), is 

A(h; T,fi) = ^JJ (h(rj) - h(6)) 2 T(d6\ V ) ^drj) h G L 2 ^) . 

Proposition B.3. Suppose Assumption \3. 2 and condition ffl2l) hold. If (p G <Pf and 

g G Q v , then there is a measure //^ which is proportional to fi such that 

lKD 1/ (g; ( p)<2A{ y /g;T, fJl(p ). 

Proof. Since <p G there exists < M v < oo such that 

\\<p{0)-<p(v)\\ 2 <M v f{9,ri) for all 9, V 

For C G C define ^(C) = M^fiiC) and suppose g G Q v . In the following the first 
inequality is from Proposition IB.2I and the second is obtained from Proposition IB.l( d) 
and that ip G 



lRD v (g;0) <JJ \\<p(6) - <p(r,)\\\y/g(9) - ^g( V )Y R(d6\ v ) u(d V ) 
< JJ(rf(0)-y/g{v)) 3 T(d0\ V )Myit(dn) 



2A{ y /g;T, fJ , ip ) . 
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□ 

B.2. Proof of Theorem 13.31 Let the measures ip{-\rj) and /i and the transition kernel 
T be as defined at equations (fTUj) . (]3"4"]) . and (ITT]) , respectively. Also, suppose <p G 
By Proposition IB. 31 if g G Q v , then 

IRD^;^) <2A(v^;T,^) 

where //^ = M v fi. By Proposition IB. II we have that /i and z/ are equivalent measures 
and T is a /^-symmetric Markov kernel. Since the measures yL v and v are equivalent 
and the chain with kernel T is locally //-recurrent, it is also locally /x^-recurrent — that 
is, every /i^-proper set C is locally recurrent. Thus, by Theorem IA.2I 

inf A(h;T,n v ) = 

h^<p (c) 

where H^C) collects the square-integrable dominators of Ic- 

Now let h G Tlfi (C) and recall that by assumption T is a measurable function 
uniformly bounded away from zero //-almost everywhere. Hence there exists some e > 
such that 



h 2 (r]) v(drj) < \ h 2 (7])T(r]) v{drf) 
h 2 {r]) fi{dr]) 

< oo . 

Hence h G L 2 (u) and we conclude that H^AC) C Q V {C). Moreover, if yh G HuAC), 
then h G Q V {C). Thus we obtain 

inf IRDJg: 0) = 0. 

Qu(C) ' 

Therefore, tp is an almost- z/ admissible estimator by Theorem IA. 31 

B.3. Proof of Theorem 13.41 Recall that v u G JF V is a bounded perturbation of v. 
The sampling and posterior distributions define the ^-symmetric Eaton kernel 

R u (d6\r l )= [ Q u {d9\x) P{dx\rj) 
Jx 
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where the perturbed posterior was defined at ( |3~Ti) . The mean of ip with respect to 
Q u (-\x) is the formal Bayes estimator of ip{6) under squared error loss. We denote it 
tp u to emphasize its dependence on the perturbed prior. 

Since u u e J-„, there exists < c < oo such that v u (d6) = u(8)u(d9) and 1/c < 
u{9) < c. Recall the definition of from the proof of Proposition IB. 31 In the following 
the first inequality is from Proposition \B.2\ the second follows by noting that u and 
1/u are both bounded above by c while the third is obtained from Proposition IB. 1( d) 
and that ip e <Pf 



IRD„ u (g; <p u ) < J J \\<p(0) - V ( V )\\\^(6) - Jg{ v )Y R u (d8\ v ) v u (d v ) 
<c 3 jj \\<p(0) - <p(<n) \\ 2 (Vg(6) - Mv)? R(d8\v) u(dq) 

<c 3 JJ Wg{9) - V9(V)) 2 T(d6\ v ) M^(dv) 

= 2c 3 A(v^;T,^) • 
The remainder of the proof follows the proof of Theorem 13.31 exactly. 

Appendix C. Proof of Theorem 13.51 

Before proving Theorem 13.51 we require an analogue of Proposition IB. II Given a 
nonnegative function h on [0, oo), define 

(36) h{9) = h{t{9)). 

Proposition C.l. Suppose Assumptions \3.c\ and \3.4 hold. If f((3,a) = f(a,(3), then 

(a) \x is a-finite and equivalent to v, 

(b) T is a Radon- Nikodym derivative of p, with respect to v, 

(c) T is a fi- symmetric Markov transition kernel, 

(d) T(d/3\a) jl(da) = f(f3, a) R(d(3\a) u(da), 

(e) h G L 2 (fi) implies h is in L 2 (fi), 

(f) A(h;T,fi) = A(h;T,n), and 

(g) T(i]) is almost-v bounded away from zero. 

Proof. The proof of the first 3 assertions follows exactly the proof of the first 3 assertions 
in Proposition IB. II with [0, oo), A, a, (3, z>, and R substituted for i9, C, rj, 8, u, and R, 
respectively. The substitution 

(37) f(d/3\a) fx{da) = f(P, a) R(dp\a) u(da) 
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follows as a consequence. If h G L 2 (jl), then h G L 2 {p) since 
h 2 {a)jl{da) = [ h 2 {a)T{a)i){da) 

h 2 {a) J f(p,a) J Q{dp\x)P{dx\a)P{da) 

h 2 ( v ) J f(t(6),t(rj)) J Q(dd\x)P(dx\r))v(dr}) 

h 2 (r])T(r])u(dr]) 

h 2 {7])n{d7]). 
Since 

(h(P) - h(a)) 2 f(d(3\a)fi(da) 

(h((3) - h(a)) 2 f((3, a)R(dP\a)v(da) by (ETJ) 

(h(P) - h(a)) 2 f((3,a) J Q{df3\x)P(dx\a)v{da) by flT@ 
J j (h(6) - h(7i)) 2 f(t(0),t(ri)) J Q{de\x)P{dx\ V )u{d V ) by ([H, (H, (USD, and (JMD 
- h( V )) 2 M9)Av))R{de\ V )v(d V ) by (ED 

(/i(0) - h(ri)) 2 T(d6\ri)fi(dri) by ([BSD, 



it follows that A(h; T, p) = A(h; T, //). Finally, recall that T is almost- z/ bounded away 
from zero by some positive constant c. Note that for any //-proper set A, 



T(7i)v(dri) = / T(a)u(da) > cv{A) = cv{r l {A)). 

It follows that T is almost-z/ bounded away from zero. □ 

We are now ready to prove Theorem 13.51 

C.l. Proof of Theorem 13.51 By Proposition IC.ll the measures ft and v are equiv- 
alent, and the kernel T is /i-symmetric. Thus, the chain with kernel T is locally pr 
recurrent, and by Theorem IA.2I 

inf A(h:f,u) = 
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for any //-proper set A. 

Let (Ai) be a sequence of /2-proper sets increasing to [0, oo). Letting Cj = t~ l (Ai) 
defines a sequence of //-proper sets increasing to Since h{6) = h(t(9)), h > implies 
that h > Id- By Proposition IC 11 h G L 2 (ji) implies h G L 2 (fi) and the corresponding 
Dirichlet forms are equal. Thus, 

inf A(h:T.u)= inf A(h:f,u) = 

for every Cj in the sequence. By Theorem IA.lt the chain with kernel T is locally /t- 
recurrent. Since /t and z/ are equivalent measures, the chain is locally //-recurrent as 
well. 

By Proposition IC.14 T is almost- v bounded away from zero, and thus all of the 
conditions for Theorem 13 .41 are satisfied. Therefore, under squared error loss, the family 
T v is ^/-admissible. 

Appendix D. Multivariate Normal 
Here we give the supplemental arguments required for the proof of Theorem 14.11 

D.l. Existence of the integrated risk difference. In order to appeal to Blyth's 
method, we need to know that the integrated risk differences are defined. Otherwise 
the bounding inequality (IB. 21) for the integrated risk differences is meaningless. The 
proper Bayes estimators necessarily have finite integrated risks, so it is sufficient to 
show that 

J R{6;6)g n {e)v{d6) < oo 

where g n is the indicator of C n a closed ball around zero with radius n. Since v is 
(7-finite, v[Cn) is finite. Since the risk function is real-valued and continuous, it attains 
a finite maximum on C n ; call it M n . Therefore, 



R(0; 6) g n (6) u(d6) < M n u(C n ) < oo 
and the integrated risk differences are defined. 

D.2. Continuity of the transition kernel. Let m be a nonnegative integer. We 
wish to show that T([0, m]|a) is continuous as a function of a. Since by definition 

f™pR(dp\a) + aR([0,m]|a) + cR{[0,m\\a) 



T([0,m]\a) 



J (3R(d(3\a) + a + c 
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it is sufficient to show that R([0, m]\a) is continuous and that continuity of 



/TO /• 
(3R(d(3\a) and J (3R(d(3\a) 



follows from there. 

Fix a* E [0,oo) and 5 greater than zero. Let S$(a*) = [0, (\fa* + S) 2 ). Let (a n ) be 
a sequence with limit a* whose elements are in Ss(a*). Define 

A s (a*) = {x : < 2(V^ + 5)} 

—an open ball in X with radius greater than twice y/a*. Let S denote the surface of 
the unit hypersphere in W. For any x £ A c & (a*) — that is, any point such that ||x|| is 
greater than twice \fa* + 5 — and ^ on the unit hypersphere 



\\x - £Va^|| > \\x (1 - v^/IMDII > \\x/2\\ 

since x cannot be closer to any point with radius ^/a^ than it is to Xy^a^/Uxll, the 
point with radius y/a^ on the common ray and since ^fa^ is less than \fa* + 5 by 
construction. Recall that 

p( x \a) = J^2n)- p / 2 e-^ x -^ 7r(de) 
where n is the uniform distribution on S. Let 

gi(x) = (2tt)- p/2 I A6 {a*){x) + I A ^)(x)e~^' 
Since J gi(x) 7r(c?^) < 00 and gi(x) dominates the integrand of p(x\a n ), we can say that 

lim p(x\a n ) = p(x\a*) . 

Furthermore, f gi{x)dx < 00 and gi(x) dominates p(x\a n ) by monotonicity of the 
integral. Therefore, 

lim / p(x\a n ) dx — I lim p(x\a n ) dx = I p(x\a*) dx . 

n— >oo J J n— 5>oo J 

Denote the Borel a-algebra on [0, 00) by £>, and choose B e B. Since the densities p 
and q are necessarily nonnegative and measurable, Fubini says 

R(B\a)= I I Q(df3\x) P(dx\a) = [ p(x\a) Q{B\x) dx . 

Jb J X J X 

Similarly, 

/3R(d/3\a)= [ /3 [ Q(d(3\x) P(dx\a) = [ p(x\a) [ (3 q((3\x) d(3 dx . 
Jb J x J x Jb 
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Let 

f(x) — p(x\a*)Q(B\x) and f n (x) = p(x\a n )Q(B\x) 

so that 

lim f n (x) = f(x). 

n— >oo 

Since Q(B\x) is a probability, 

fn{x) < P{x\a n ) < gi(x) . 
Hence, by the dominated convergence theorem, 

lim / f n (x)dx= / lim f n (x)dx= / f(x)dx. 

n— >oo J J n— >oo J 

That is, 

lim R(B\a n ) = R(B\a*). 

Therefore, R(B\a) is a continuous function of a. 
Now note that 

f I3q((5\x)df3 < [ 0q(P\x)dp. 

B , J 



By Proposition A. 7 of Eaton et alia ll|], for some bounded ^(H^H 2 )) 

Pq(P\x) d(3 = -p + \\x\\ 2 + i;(\\x\\ 2 ) 
so that there exists a constant k > such that 

J /3q((3\x) d(3 < k+ \\x\\ 2 . 

Let g2(x) = k + \\x\\ 2 , and let g(x) = gi(x)g2(x). Let 



fn(x) = p(x\a n ) / (3 q(/3\x) d/3 and f(x) = p(x\a*) / f3 q(/3\x) d/3 
Jb Jb 

Note that 

lim f n (x) = f(x) 

n— >oo 

and g{x) dominates f n (x). One can show that 



J g{x) dx < oo 



by expanding the product gi{x)g 2 {x) and integrating the components. Now, by the 
Dominated Convergence Theorem, 

lim / f n (x)dx= / lim f n (x) dx = / f(x)dx. 
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That is, 

lim / (3R(d(3\a n ) = [ (3R(d(3\a*) . 

n ^°°JB Jb 

Therefore, f B (3R(d(3\a) is continuous as a function of a. Since B is any Borel set, the 
continuity holds for both [0,m] and [0,oo). Finally, since T([0,m]|a) is an algebraic 
combination of continuous functions, it is itself a continuous function. 



D.3. Moment conditions. We are interested in moments of the transition kernel 
about the current state: 

m k (a) = J (f3-a) k f(d(3\a). 

We can express these moments in terms of R as 

J(f3- a) k (/3 + a) R(df3\a) + cj(f3-a) k R(dfi\a) 



m k \a) 



j{(3 + a + c) R(df3\a) 



We know from Appendix A of Eaton et alia 11] that 
(38) / PR(dP\a) = ot + 0i(a), 



(39) 
(40) 



f3 2 R(d[3\a) = a 2 + 8a + 4*2(01) , and 
(3 3 R(df3\a) = a 3 + 24a 2 + 3 (a) . 



where, as a — > 00, (pi(a) = 0(a : ), 02(a) = 0(1), and ^3(0;) = 0(a). A similar 
argument shows that 



(41) 



/3 4 R(df3\a) = a 4 + 48a 3 + 4 («) 



where 4 (a) = 0(a 2 ) as a — > 00. 

Let g (z) = (a + zXZ! 2 \ and let t k (y) = E[g (U)U k \y) with U ~ X 2 p (v)- Proposition 
A. 2 of Eaton et alia [llj establishes that if Y ~ x 2 (a), then 

~t k (Y) 



(3 k R(df3\a) = E 



a 



< 00 . 



Let 



w k [n) 



to(Y) 

g (z/2) n+ P/ 2+k ~ 1 e- z / 2 
2T(n + p/2) 



dz 
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and note that E[g (U)U k \y] = 2 k E[w k (N)\y] where N\y ~ Poisson(?//2). This last 
equality follows from expressing U as a Poisson mixture of \\ random variables. From 
the definition of t k and w k , we have that r k (y) = 2 k E[w k (N)\y]. Therefore, 



p k R(dp\a) = 2 k E 



E(w k (N)\Y) 



[E(w (N)\Y) 



a 



Proposition A. 7 of Eaton et alia rjj establishes that 

E[w k (N)\y] _ y p 
E[w k „ 1 (N)\y]-2 +2{k 1} 2 +My) 

where \ip k (y)\ < d k /y for some finite positive constant d k . This allows us to evaluate 
the right hand side of as 

E(w k (N)\Y) E( Wl (N)\Y) 



f3 k R(df3\a) 



2 K E 



a 



E{w k „ x {N)\Y) E(w (N)\Y) 

Furthermore, E[-0fc(Y)|a] = 0(a~ l ) by Proposition A. 8 of Eaton et alia llj . Note that 
the third moment of a non-central Xp( a ) is °? + 0(a 2 ) and the fourth moment is 

a 4 + 24a 3 + Apa 3 + 0(a 2 ) . 

Combining these results leads to (HTj) . 

We now begin to find expressions for the transitional moments m k in terms of equa- 
tions ( 1581 . (1391) . (1401) . and (jHJ) as a becomes large. First, note that if < c < oo, 
then 

cj (0 - a) 2 R(d(3\a) = 0(a) and c J '(/? - af R(d/3\a) = 0(a 2 ). 
Express the first transitional moment as 

f(0-a)(p + a) R(dp\a) + cj((3-a) R(d(3\a) 



mil a) 



f((3 + a + c) R(df3\a) 
J (3 2 R(d/3\a) - a 2 + c J f3 R(d(3\a) - ca 
J((3 + a + c)R(d(3\a) 
8a + 0(1) 



f(0 + a + c) R(d/3\a) 
Since ((3 — a) 2 ((3 + a) = /3 3 — [5 2 a — (5a 2 + a 3 , we have that 

((3 - a) 2 ((3 + a) R(d/3\a) = a 3 + 24a 2 + 0(a) - a 3 - 8a 2 - 0(a) - a 3 - 0(a) + a 3 

= 16a 2 + 0(a) . 
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Express the second transitional moment as 

_ J(/3 - a) 2 (/3 + a) R(df3\a) + cf(/3-a) 2 R(df3\a) 
~ J(f3 + a + c) R(df3\a) 

16a 2 + 0(a) 
~ J(f3 + a + c) R(df3\a)' 

Since (f3 - af((3 + a) = /3 4 - 2/3 3 a + 2/3a 3 - a 4 , we have that 

J (/3 - a) 3 (/3 + a) R(d(3\a) = a 4 + 48a 3 + 0(a 2 ) - 2a 4 - 48a 3 - 0(a 2 ) 

+ 2a 4 + 0(a) - a 4 
= 0(a 2 ). 

Since our expressions for the transitional moments all share a common denominator, 
the ratio m 3 (a) / m 2 (a) may be evaluated as 

m 3 (a) _/(/?- a) 3 (/3 + a) R(df3\a) + c J(f3 - a) 3 R(df3\a) 
m 2 (a) ~ j((3 - a ) 2 ((3 + a) R(df3\a) + cj((3- a) 2 R(df3\a) 
0(a 2 ) 

~ 16a 2 + 0(a) 

= 0(1). 
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